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Abstract 

In this paper we consider polynomial cointegrating relationships 
among stationary processes with long range dependence. We express 
the regression functions in terms of Hcrmitc polynomials and we con- 
sider a form of spectral regression around frequency zero. For these 
estimates, we establish consistency by means of a more general result 
on continuously averaged estimates of the spectral density matrix at 
frequency zero. 

Keywords and phrases: Nonlinear cointegration, Long memory, 
Hcrmite polynomials, Spectral regression, Diagram formula. 

AMS classification: Primary 62M15, Secondary 62M10, 60G10 



1 Introduction 



The extension of the standard cointegration paradigm to more general, frac- 
tional circumstances has drawn increasing attention in the time series lit- 
erature over the last decade. The possibility of fractional cointegratio n 
was already men tioned in the seminal paper bv lEngle and Granger! ((1987). 
Robinson! ()1994l ) was the first to establish consistency for narrow-band es- 
timates of fractional cointegrating relationships in the stationary case. The 

"This paper is part of the first author PhD dissertation. We are grateful to Franco 
Peracchi for useful comments on an earlier version. 
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properties of this estimator (which has become kn own as NBLS) were then 
invest i gated under nonstationary circumstanc e s bvlMarimicci and R obinson 
( 200lh . lR.obmson and Marinuccil ()200ll . l200.4 ). Ichen and Hurvichl (j2003al lhh 
considered principal components methods in the frequency domain, whereas 
VelascJlEnn^ . lRobinson and Hualdl l2003h advocate pseudo-maximum like- 
lihood methods which improve the efficiency of the estimates and yield stan- 
dard asymptotic properties. Cointe gration among s t ationary processes has 
also b een considered, for instance by Marinucci (200(3), Christensen and Nielsen 
21 



Many other insightful pape rs on fractional coint e gratio n have ap- 
peared in the literature, for instance Dolado and MarmoJ ( 2004 ). Davidson 
(|2002h . 

All these papers have focused on the case of linear cointegration. Nev- 
ertheless, the possibility of polynomial cointegrating relationships seems of 
practical interest, for instance (but not exclusively) for applications to fi- 
nancial data. Non linear cointegration has been considered in the literature 
(most recently by Karlsen. Mvklebust. and Tiostheim ( 2005^ 1 ). but only in 
non-fractional circumstances, to the best of our knowledge. In this paper, 
we shall focus on nonlinear cointegrating relationships among stationary 
long memory processes; the restriction to a stationarity framework is made 
necessary by the need to exploit the rich machinery of expansions into Her- 
mite polynomials, an extrem ely powerful tool to investigat e nonlinea r trans 



formations (see for instance iGiraitis and Surgaihi 



Arconesl (|l994h . 



Surgailisl (feoO.'j )). Our general setting can be explained as follows. Let 
{A t } = {xt,et}, t £ Z be a stationary bivariate time series with mean zero 
and covariance such that 

EA t Ai 



t+T 



r(r) 



2tt 



f(\y rX d\ 



/(A) 



where 

/ex (A) 

is the spectral density matrix of {^4t}. 
memory, in the sense that 



/xe(A) 
feeW 

We shall take {xt,et} to be long 



(1) 



for a, b = x, e , < d a , dj, < 5, G xx , G ee > 0, \G xe \ > 0. We write z ~ I{d z ) 
for long memory processes with memory parameter d z , and ~ to denote that 
the ratio of the left- and right-hand sides tends to 1. 

Now assume there is a polynomial function g(-) such that E[g(x<)] = 

and 

Vt = g(xt) + e t , 0<d e < d y <d x < 1/2 ; (2) 
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in this case, we say that yt,Xt are nonlinearly cointegrated. Clearly, the 
standard (stationary) fractional cointegrating relationship is obtained in the 
special case where g(-) is a linear function. 

Our main idea in this paper is to write <?(•) as a sum of Hermite poly- 
nomials; the coefficients of these p olynomial s will b e estimated by m eans 
of a spectral regression method , as iRobinsonl Jl994h . iMarinuccil feooch and 



Marinucci and Robinson! ([20011 ). We shall show that, by using a degener- 



ating band of frequencies around the origin, then the estimator of these 
coefficients is consistent, even if xt and e< are allowed to be correlated. The 
plan of this paper is as follows; in Section 2 we review some results on 
long memory processes, Hermite polynomials and the diagram formula; in 
Section 3 we discuss consistent estimation of nonlinear cointegrating rela- 
tionships, whereas in Section 4 we collect some comments and directions for 
future research. Some technical results are collected in an Appendix. In the 
sequel, C denotes a generic, positive, finite constant, which need not to be 
the same all the time it is used; for two generic matrices A and B, of equal 
dimension, we say that A ~ B if, for each the ratio of the (i,j)-th 

elements of A and B tends to unity. 



2 Nonlinear transformation of long memory pro- 
cess 

It is well-known that, under regularity conditions, a consistent estimator of 
the spec t ral d ensity matrix at frequency zero is given by (see for instance 
Harmaul ()l97nh . p. 246) 



/ a6 (0)= ^ K M (X)I ab (X)dX (3) 

where the kernel Km{X) is symmetric and such that J^Km(X)(1\ = 1, and 
M is a positive integer satisfying the bandwidth condition 1/M + M/n — > 0. 
Here, I a b(^) is the periodogram, that is 

^ n— 1 



-71+1 



for 



n 1 J2t=i a tbt+r r > 

n ~ 1 J2t=\r\ + l a t b t~\r\ T<0 



cab(r) = { __ 1 ^*r 1 Jl , ot,6 t €R,t=l J 2... J n . 
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Equation (jSJ) can be rewritten as 



/oft(O) 



1 

2^ 



n-l 

£ 

—n+l 



if M (A)exp(iTA)(2A . (4) 



As usual, we call the function /cm(") a lag window, and the corresponding 
estimator Q the lag window spectral density estimator. The asymptotic be- 
haviour of fab(fi) under short rang e dependence conditions is now s tandard 
textbook material, see for instance 



Taniguchi and KakizawfJ (Eoflflf) . Under 
long memory circumstances, iRobinsonl ( 1994 ) and Lobato ()l997l ) investi- 



gated the beh aviour of, respect ively, a discrete univariate and multivariate 
version of (jSJ); IRobinsonl (J199J) propose an application of this statistic for 
the estimation of the cointegrating vector in a stationary framework. The 
behaviour of the lag window estimator under long memory is discussed by 
Marinuccil (boooh . 

In the latter reference, the following linear cointegrating relationship is 
considered: 



lit 



Pxt + et, < d e < d y = d x < 1/2 , 



and it is shown that (3 is consistently estimated by 



0. 



M 



jZ n K M (X)I ab (X)dX Yin 



-n+l 



^M(r)c a& (r) 



Er=-n+l k M (T)c bb (r) 



f^K M (X)I bb (X)d\ 

under the bandwidth condition M 2 = o(n) as n — > oo. We consider here the 
same procedure but under generalized, polynomial circumstances. The in- 
vestigation of nonlinear transformations requires the computation of the cu- 
mulants of the Hermite polynomials; this is usually achieved by means of the 
so-called diagram formula (see for instance lArconesI (|1994h ). We introduce 
now the main ideas behind this approach. Consider the polynomia l tran s- 
formation of the Gaussian process (see also Dittmann and Granger ((20021 )) 



K K 
g(zt) = a k z t = Y hH k (z t ) , k > 1 , t = 1,2, 

k=ko k=kg 



n 



(5) 



where 



bk 



E(g(z)H k (z)) 
k\a 2k 



and H k (.) denote the well-known Hermite polynomials, which form a com- 
plete orthogonal system in the space L 2 (m. 1 , (\/27r<7 2 ) -1 exp(— z 2 /2a 2 ^j of 
square integrable functions of Gaussian variables. These polynomials are 
defined through the formula: 



(- 



-l)V J exp(^)|-exp(-^) 



1,2,... 
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Straightforward computation show that the first five polynomials are: 

H (z) ~ 1 D--'-^-- n-./--\_-2 ,2 h-./-^ _ „3 o„_2 



1 , fli(z) = 2 , fl" 3 («) = - (T 2 , fl" 3 («) = Z J - 3^(7^ 



# 4 (z) 



2 4 _ 6z2fj2 + 3 4 



10zV + 15za 4 



The Hermite polynomials satisfy the differential equation kH k _i{z) = dH^z) / 'dz, 
under the boundary conditions E[H m (Z)] = 0, where Z is a zero mean Gaus- 
sian variable with variance a 2 . It is also well-known that, for any mean zero 
Gaussian random variables v and u, we have: 



E[H p (u)H q {v)} 



pi [¥,(uv)] p for p = q 
for p ^ q 



(6) 



The index of the first non null coefficient b k is termed Hermite rank of 
<?(.). Of course, yt is non-Gaussian unless g(-) is linear. 

For our aims, the most important property of Hermite polynomials is 
their orthogonality. This property allows us to characterize in a simple 
way the dependence structure of a nonlinear transformation of a stationary 
Gaussian process tha t exhib i t long range dep endence. The following; result s 
are essentially due to Taaau ( 19751 . 1979h and Dobrushin and Mai or ( 1979^ . 
The non-Gaussian case is m ore compl i cated and some results using Appell 
polynomials are provided by Surgailisl ([2OO0I ), Let z t ~ I(d z ); in view of £Q) 
and © it is easy to see that 



E[H k (z )H k (z T )] = kl~f k (r) ~ Cr k ^ 



-i) 



as t 



so the sequence H k+ i(z t ) is "less" dependent then H k (zt). More precisely, 
if zt ~ I(d z ), then H k (zt) can be viewed as a long memory series that is 
fractional integrated of order d k , 



d 



k ■- 



k d 



The above equation follows straightforwardly from the equality 2d k — 1 = 
k(2d z — 1). The fact that a nonlinear transformation of a Gaussian process 
with long range dependence cannot increase its memory can be used to 
determine the leading term of the expansion ©. For instance, assume for 
notational simplicity that z% has unit variance. From © 



E[H k (z t )H k {z t+T )] = kl 7 k z ( T ) = k\ 



'r k \x)d\ 



where 



-^fc-i)/z(^i) • • • f z (uj k ^ 1 )duj 1 . . . du) k -x 
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is the k fold convolution of f z {X) with himself (IHannan (|l97oh . lDaHa. Giratis. and Hirialgnl 



(2004)). The convolution is defined extending /(.) periodically outside of 



-7r, 7r]. We now have, with an obvious notation: 

K 



f 9 (x)=^2b 2 k k\f(* k H\) 



k=kn 



Therefore, also in the frequency domain, the rank of the transformation 
determines the feature of the spectral density, and for A — > 0, the memory 
of g(zt). 

Let us now introduce diagrams, which are mnemonic devices for compu- 
tation of moments and cumulants of polynomial f orms in Gauss i an ra ndom 
variables. Our presentation follows Arconesl ( 1994h and Surgailisl ( 2003^ . Let 



p and £j , j = 1, . . . ,p, be given integers. A diagram G of order (£\, . . . £ p ) is 
a set of points {(j,£)) ■ 1 < j < p; 1 < £ < £j} called vertexes, and a set of 
pairs of these points 

{({j,£),(k,s)) :l< j <k<p;l< £<£„!< s <£ s } , 

called edges. Every vertex is of degree one, that is, it is considered one time 
for each graph. We denote by V(£i, ■ ■ ■ ,£ p ) the set of diagrams of order 
(£i, ■ ■ ■ ,£ p ). The set is empty if £\ + • • • + £ p is an odd number. The set 
Lj = {(j,f) : 1 < I < £j} is called the jth level of G. We will denote the 
set of edges of diagram G by £(G). Observe that edges connect vertexes 
of different levels (no flat edges). A diagram G is said to be connected if 
the rows of the table cannot be divided in two groups, each of which is 
partitioned by the diagram separately. In other words , G is connected if 
one cannot find a partition P 1 \J P 2 = {1, . . . ,p}, Pi n P 2 = 0, Pi,P 2 + 0, 
such that, for £(G) = (gi, . . . gk), either gi G Ujgp^j or gi € Uj € p 2 Lj holds, 
for i = 1, . . . r, where r is the number of edges gi of the diagram G. The set 
of connected diagrams are indicated by V c (£i, . . . , £ p ). The main instrument 
we shall need below is the following, well known: 

Diagram Formula: Let (z\, . . . z p ) be a centered Gaussian vector, and let 
7ij = K(ziZj), i,j = 1, . . . ,p. Let L be a table consisting of p rows £\, . . .£ p , 
where lj is the order of Hermite polynomial in the variable Zj. Then 

4rW;>) = E n tS" 

ti=i J Gev(i 1 ,...,e p )i<i<j<p 

cum(H ei ( Zl ),...,H ep (z p )) £ II it; 3 

Gev c (£i,...,e p ) i<i<j<p 
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where, for each Gaussian diagram, aij is the number of edges between rows 
£i,£j and cum (1/4(2:1), Hi p (z p )) represents the p— th order cumulant. 

Examples of diagrams are represented in Figures 1 to 7 in the Appendix. 

3 Nonlinear cointegration 

We state here more precisely our full set of assumptions. 
Assumption A 

1) The following equation holds: 

Vt = g{x t ) + e t , (7) 

where for t = 1,2, ... 

g(%t) = Ef=fc a k x t = EfLfco b kH k (x t ) , b ko + o , 

* = Et~ k0 hi = EfU, t k H~ k {e t ) , t k0 + , 

2) (xt,£t)' are jointly Gaussian and long memory, that is, as r — ► oo 

1xx{t) ~ G xx T 2dx ~ l , < G xx < oo 
7££ (r) ~ G ee r 2d *-\ < G £e < oo 
1 X s(t) ~ G xs T d * +d *-\ \G X£ \ < oo 

for < d e , d x < 5 . 

3) The parameters K, ko are such that 

K(2d x -1) > {-lVfc (24-l)} • 

Assumptions A1-A2 identify a polynomial cointegration model where 
the residual is a Gaussian subordinated process. Assumption A3 ensures 
that Hx(xt) is still a long memory process, with stronger memory than et. 
This is needed for consistency and indeed it is also a necessary identification 
condition: recall x± and et can be correlated, so there are no means to distin- 
guish H k (xt) and et unless the former has stronger long range dependence. 
Recall that H k (xt) ~ I(d k ) , k = ko, K, where 2d k — 1 := k(2d x — 1). In 
this paper, we take ko and K to be known, whereas their estimation will 
be addressed in a different work. Note that to implement our estimates we 
need no a priori information on ko, K, although the value of ko(2d e — 1) does 
affect the rate of consistency of our estimators. 
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As mentioned before, (j2J is a cointegrating relation, so we allow K(xt£t) 
(and hence E(xtet)) to be different from zero. As for linear cointegration, 
this leads to the inconsistency of OLS and justifies the use of the spectral 
regression methods for the estimation of Hermite coefficients. Concerning 
the kernel, we write ku (•) = k(r/M) and introduce the following 

Assumption B: The kernel k(-) is a real-valued, symmetric Lebesgue 
measurable function that, for v € R, satisfies 




k(v)dv = 1 < k(v) < oo, k(v) = for > 1. 



Our final assumption is a standard bandwidth condition. 



Assumption C: Let rj = K V ko; as n — > oo , 

1 M 3 ^" 2 ) 



Assumption C imposes a minimal lower bound and a significant upper 
bound on the behaviour of the user-chosen bandwidth parameter M. The 
need for this bandwidth condition is made clear by inspection of the proof 
in the appendix; heuristically, as K grows the signal in Hx(xt) decreases, 
which makes the estimation harder; on the other hand an increase in ko 
makes the convergence rates in Lemma 1 and Theorem 1 faster, whence the 
need for tighter bandwidth conditions. We are not claiming Assumption C 
is sharp, however an inspection of the Proof of Lemma 1 reveals that any 
improvement is likely to require at least almost unmanageable computations. 

Equation can be rewritten more compactly as 



Vt 



P'H(x t ) + e t , where H(x t ) = ...,H K {x t )}' 



Let us now define: 
/ii(A) 

fHH(X) 













/22(A) 



Skk{\) 



fHeW 



fleW 
/2e(A) 

fKe(X) 



8 



and let also, for a, b = 1, 2, ... K. 

lab (r) = E [H a (x t )H b (x t+T )] = a\5 b a {E (x t x t+T )} a , 

K 

7ae (r) = E[H a {x t )e t+T ] = E H a {x t ) ^H % [e t ) 

k=ko 

( a!ME(x^ +r )} a for a < A" 
, otherwise 

where represents the Kronecker delta function. Likewise 



7aa(r)e 



-i\r 



-oo 

OO 



/ aa (A) : =(2vr)- 1 
/ as/ (A) : =(2vr)- 1 / Ta^e^ , 7atf (r) := E [H a (x t )y t+T ] 

J — oo 

/oo 
7ae(r)e- aT • 
-oo 

The Weighted Covariance Estimator (WCE) of ft = (ft, ... , (3k) is defined 

@M = /ffH(0)-7H»(0) , 



as 



whence 

PM-f3 = fHH(0y 1 f He (0) ; 
as usual, we assume Jhh{0) is non-singular, where 



WO) = - 



Er=-MHr/M) Cll (r) 



Er=-MHr/M)c 1K (r) 



Er=-MHr/M)c Kl (r) ■■■ Z:L_ M k(T/M)c KK (T) 



M 



Er=-MHr/M)c lz (r) 



and 



c a z{r) 



n z2t=i H a(x t )H b (x t+T ) 

n ~ l Efc=| T |+l H a{x t )H b {x t _\ T \ 

n ~ l YJt=l H a (x t )z t+ r 
n ~ l Y.t=\T\+l H a{x t )z t _\ T \ 



T > 

r < 

r > 

r < 
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for a,b = 1,2, ... K , z = e,y. The followi ng lemma i s the m ain tool for our 



consistency result, compare Lemma 1 in iMarinuccil (J2 
write 

k 



As before, we 



d a ■= a (d x - + i , d e = 
by Assumption A3 we have d a > 0, a 



d,- 1 - 

£ 2 



+ 5 >V0; 



LEMMA 1 Under Assumptions A-C, as n — > oo we have: 

M 



r=-M 
M 



(8) 
(9) 



£ k {c ae (r) - lae {r)} = o p (M d ^) 

T=-M 

for a, 6 = 1,2, ...K 
Proof See Appendix 

We are now ready to state the main result of this paper. Let 
B ab : = a\G a xx 5 b a J k{v)\v\< 2d ^ dv < oo , 

B ae : = a\i a {G X£ } a J ^ k{v)\v\< d * +d ^ dv < oo , for a < K , 
see also Assumption B, a,b = ko, K. Let 

B HH = tiagiBn, . . . Bkk} , B He = {B u , -,B Ke } , M = diag |M" dl , . . . M 

Note that B ae = unless a < K, due to the orthogonality of Hermite 
polynomials. 

Theorem 1 Under the Assumptions A-C, as n — > oo 
M di-d e o 







hi -P)= Bj&BHe + Op(l) 



M dK - d " 
Proof By the dominated convergence theorem, as M — > oo 

T=-M T=-M 
T = -M T=-M 
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From Lemma 1, it follows easily that 



Wo) 



Ci + o p (M Ml ) o p {M dl+d2 ) 
o p (M d2+dl ) C2 + o p (M 2d2 ) 



o v {M dK 



+di 



o p (M dl+d f) 
o p (M d2+d p) 

C K + o p {M 2d «) 



where 



Moreover 



1 M 

r=-M 



Mf HH (P)M = 
Therefore, for M —> 00 



B n + o p (l) 
o P (l) 



Op(l) 

+ o p (l) 



W°) = M~ BhhM~ + op(l) = £WW~ 2 + o p (l) , 

since Sff/f is diagonal and hence commutes with M . Using the same 
arguments, it follows easily that: 

M- d ^Mf he (0) B He , as n 00 . 

Finally, as n — > 00 , 

M-^M^ 1 {/3 M - /?} = {MA„(0)Mp MM^i(O) - B^B He > 
which completes the proof of Theorem 1. 

□ 

Remark In Theorem 1 we have proved the consistency of the WCE estima- 
tor of the cointegrating vector, (3m —> (3- In a very loose sense, this result 
follows from consistency of a continuously averaged estimate of the spectral 
density at frequency zero, see Lemma 1. It is also possible to use Lemma 
1 to derive a robust estimate for the memory parameter of an observed, 
Gaussian subordinated series wt := g(xt), (ko(d x — 5) + \ =■ d w , say). We 
use a very simil ar idea to the averaged periodogram estimate advocated by 
Robinsonl (|1994f ). More precisely, with an obvious notation we can consider 



loe 



v-vM 



k{ j/r )c 1i)UI (t) 



2 log M 



log B ww 
dw+ 21oiM + ° p(1) ' 



d w + o p (l) 
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where we have used Lemma 1. This estimate converges at a mere logarith- 
mic rate and it is not asymptotically centered around zero; it is however 
consistent under much broader circumstances than usua lly allowed for in 
the literature. See also Dalla. Giratis. and Hidalgo! (2004) for very general 



results on consistency for long memory estimates. 

4 Comments and conclusions 

We view this paper as a first step in a new research direction, and as such 
we are well aware that it leaves several questions unresolved and open for 
future research. A first issue relates to the choice of the Hermite rank ko and 
of K. As far as the former is concerned, we remark that for the great ma- 
jority of practical applications, ko can be taken a priori as 1 or 2. Under the 
assumption that ko = 1, the equality d x = d y holds; this trivial observation 
immediately suggests a naive test for ko = 1, which can be simply imple- 
mented by testing for equality of the two memory parameters. It should be 
noted, however, that when x t and yt are cointegrated the standa rd asymp 



totic r esults on multivariate long memory estimation (for instance iRobinson 
do not hold. Incidentally, we note that the nonlinear framework al- 
lows to cover the possibility of cointegration among time series with different 
integration orders, a significant extension over the standard paradigm. 
For K, we can take as an identifying assumption 

K := argmax(fc : k(2d x - 1) > {2d e - 1)) ; (10) 

higher order terms can be thought of as included by definition in the resid- 
uals, to make identification possible. Indeed, it is natural to suggest to 
view g(.) as a general nonlinear function and envisage K as growing with 
n; we expect, however, that only the projection coefficients with k sat- 
isfying (|10j) could be consistently estimated in this broader framework. On 
the other hand, we note that the it is also possible to estimate consistently 
K* < K regression coefficients, by simply dropping the higher order regres- 
sors: it is immediate to see that their inclusion in the residual would not 
alter any of our asymptotic result (there may be an effect in finite samples, 
however). We stress that a lower number of regressors allows in general a 
weaker bandwidth condition, see Assumption C. 

The extension to multivariate regressors does not seem to pose any new 
theoretical problem: multivariate generalizations of Hermite expansions are 
well known to the literature. Of course, much more challenging seems to 
be the possibility to allow for multiple cointegrating relationships. An im- 
portant point to remark is the following. In standard cointegration theory, 
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the role of the variables on the left and on on the right-hand sides is, by all 
means, symmetric: this is no longer the case when nonlinear relationships 
are allowed. In particular, it should be noted that the memory parameter of 
the dependent variable yt is always smaller or equal than d x \ this informa- 
tion can be exploited in an obvious way to decide the form of the regression, 
provided that first step estimates of the long memory parameters are avail- 
able. We also remark that our procedure requires a preliminary knowledge 
on the variance of the regressor xt] such knowledge can clearly be derived 
from first step estimates, and we leave for future research the analysis of its 
consequences in finite samples. 

In this paper, we restricted ourselves to consistency results, and gave no 
hint on asymptotic distributions. The latter are likely to be non-Gaussian, 
at least if the Hermite rank is larger than one and/or the memory of the 
raw series i s such to make their autocov ariances not square summable (see 
for instance Fox and Taaau ( 19851 . 1986h . A much wider issue relates to the 



possible extension to nonstationary circumstances. Here, a major technical 
difficulty arises: the higher order terms in Hermite expansions need no longer 
be of smaller order in the presence of nonstationarity. We believe, however, 
that the stationary framework considered in this p aper is of sufficient inter- 
est by itself for applications to real data, see again IChristensen and Nielsen 
([20051 ) for examples on how fractional cointegration among stationary vari- 
ables may be implied by some models of volatility, based on the Black- 
Scholes formula for option pricing. 



Appendix 



Proof of Lemma 1 Recall we have 

7afe(r) ~ G\r\ da+db ~ 1 as r -»■ oo , 

where for a, b = 1, K, d a is such that 

= f §(2d x -l) + i for a{2d x -l)>-l 

a ' { for a(2d x - 1) < -1 

The first part of the proof follows closely Marinuccl (|200ol ). For 
sufficient to show that 

C M-\ 
Var I 



(11) 
it is 



E Ki) Cab(T) 



M-l 



-M+l 



E 



Cafe(r) 



1 



T 



n 



7ab(r) 



(t=-M+1 
M M 

< C E E \Cov{c ab (p),c ab (q)}\=o(M 2d ^) 
p=-M q=—M 
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From 



Hannan ( 197f)l ). p. 210 we have: 



Cov{c a b(p),Cab(q)} 



1 ^ / | r |\ 

- E 1 ) {laa{r)lbb{r + q-p) +lab{r + q)lba(r-p)} {12) 

n \ n J 

^ n— 1 n— r 

H — 2 E E cum «^ (s, s + p, s + r, s + r + g) , (13) 



r=— n+1 s=l— r 



where 



cum aW (s,s + p, s + r, s + r + g) = cum{il (z s ), i2&(x a+p ), # a (x s+r ), # fe (x s+r .+q)} 

Likewise, for we shall show that 

{M-i ^ r Af-i 

T=-M+l 



T 

M 



,{p) =E ^ fc 



T 

M 



c ae (r) - (1 ) 7ae(r) 



Lt=-M+1 

M M 

< c E E i^{ Cae (p),c ae (g)}i = (M 2 ^ +2 ^) 

p=-M q=—M 



where 



Cov{c ae (p),C ae (q)} 



I ^ / | r |\ 

- E 1 {laa{rhee(r + q-p) +Jae(r + qhea(r-p)} (14) 

r=— n+1 v 7 
- n— 1 n— r 

H — 2 E E cnm aeae(s,s+p,s + r,s + r + q) , (15) 



r=— n+1 s=l— r 



and 



cuni aeae (s, s +p, s + r, s + r + g) 
= cum {H a (x s ), e s+p , H a (x s+r ),e s+r+q } 
k k 

= E E cnm { H a(x s ) 1 H k (e s+p )H a (x s+r ),H k/ ,(e s+r+q )} . 

k=kQ k'=ko 

The argument for (|12j) and (|14[) is the same. For instance, for (|12|) we have 
M A/ 1 n-1 . , ,v 

E E - E i 1 - V) {7 - (r)7f * (r+{? - p)} 

=-A/ q=-M r =-n+l ^ 7 



< 



p=_M g 

,M 



2Af 



C— E E (H + l^^dr + rl + l) 2 *- 1 ^- 



r=-2Af \\r\<2M 
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+ E (H + lJ^^dr + rl + l) 2 *- 1 
r 

/ 2M \ 

E (M + l) Mo ~ 1 E (k + rl + 1) 2 *" 1 

|<2M V r=-2Af / 



|r|>2M 

M 



2M / 

+ E E (H + l^-'dr + rl + l) 2 *- 1 

r=-2M \2M<|r|<n 

= 0{Mn- x M 2d °-M 2d ») + O (M 2 n~ 1 n 2da+2d b ~ 1 ) = (M 2da+2db ) . 

As usual, summations over empty sets are taken to be equal to zero. For 
the second term we have: 



M M 

p=-M q=-M 
M M 



n-1 



E i 



r=— n+1 
n-1 



< 



c E E \ E (i-^)^|7^+p)+7 2 >-,)| 



* - E 



p=-M q=-M r=— n+1 

A/ M 

^ (| r +p | + 1 )2*,+2*-2 + ^2 (|r - ?| + l) 2d «+ 2 ^- 

p=-M q=-M 



\r\<2M 

M 2 



+C— E [(|r+p| + l) 2 ^ +2 ^- 2 + (|r-g| + l) 2 ^ +2d ^ 2 

2A/<|r|<n 

0(Mn- 1 M Ma+2a! '') + 0(M 2 n- 1 n Ma+2dfc - 1 ) = o(M 2da+M ") . 



The orders of magnitude of the cumulants are investigated by means of the 
diagram formula. The proof is quite tedious. We shall focus on (|15j) as the 
argument for (|13|) is entirely analogous. From the diagram formula it follows 
easily that, for any finite k, k' > ko 

k k 

E E \ cum { H a(x s ), H k (e s+P ), H a (x s+r ), H k i(e s+r+q )}\ 

k=kg k'=ko 



< C cum ^H a (x s ),Hj io (e s+p ),H a (x s+r ),H^ o (£ s+r+q )^ 



(16) 



Indeed, increasing the value of ko to k, k' entails including more products of 
covariances in the cumulant, and these covariances are bounded. In order 
to simplify the presentation, we divide it in three parts, that is 

1) a = 1, ko > 2 or a > 2, ko = 1 

2) a = 2, k > 2 or a > 2, k = 2 
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3) a, ho > 3. 

Throughout the proof, we shall assume for brevity's sake ko(2d e — 1) > 
— 1; it is simple to check that for ko(2d £ — 1) < —1 the proof is analogous, 
indeed slightly simpler. 



Part I: a = 1, k > 2 or a > 2, k = 1 

For a = 1, ko = 2 we have 

M M 



E E ■^\cvLm{x s ,H 2 (e s+p ),x s+r ,H 2 (e s+r+q )}\ 



p=—M q=-AI 
M M q 

p=-M q=-M 



n—1 ri—r 



E E lxe(p)lxe(qhee(r + q - p) 
r=—n+l s=l—r 

+lee{r + q~ phex{r - p)~/ X£ (r + q)\ 
q M M 



< - E (H + l)^ +d£_1 E (M + l)*""^" 1 E {\r + q-p\ + l? d "- 1 



p=-M 



q=-M 



\r <3M 



+ E (k + 'z-pl + i) 2 ^- 1 

3M<|r|<n 



+^E E E (\r+q-p\ + l) 2d '-H\r-p\ + l) d * +d '-H\r+q\ + iy 



:+d £ -l 



p=-Mq=-M \\r\<3M 



+ E {\r + q-p\+l? ds - 1 {\r-p\ + l) dx+d ^ 1 {\r + q\ + l) d * +d ^ 1 
3M<\r\<n 

0{n- l M d * +d ^M dx+d ^M 2d t) + (9(n- 1 M 2ci:c+Me n Me ) 
+0(n~ 1 MM dx+ds M 2d ^) + O^MV^^" 3 ) 

4^+2^ _,_ n A ^+3(^-1 ^ _|_ ^4d £ +2d a; 

o(M M - +4d£ - 1 ) = o(M M * +Me ) because 2d e = 4d e - 1 . 



( Mj^+^-i^ + (M 4de+2 ^) + O ( —M^ 3 ^- 1 
n J \ n 



The extension to k$ > 2 is trivial: 

H-^^Eg+p), X s -\- r , i?^ o (e s -|_ r _)_g) j- 



cum 

M M 

E E 

p=-M q=-M 



c 



n- 



n— 1 n— r 



r=— n+1 s=l—r 
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+%e 1 (r + q- p)jex (r - phxe (r + 

0(n" 1 M Ma:+2a!£ M ( ~ fc o- 1 )( 2 ^- 1 )+ 1 ) + Oin' 1 M M * +M£ n ( ~ fc o _1)(2a!e ~ 1)+1 ) 
+0(n" 1 M 4+4+1 M ( ' 4 ')" 1)(2 ' i£ " 1)+1 ) + OirT 1 M 2 n M * +Ms_2+ ^° _1)(Me_1)+1 ) 

+ ^^M d;c+ci£ ~ 1 M ( ~ fc o~ 1)(M£ " 1)+1 ^ + O^^MV^^ 2 ^" 1 )) 
o(M 5 



n 

2d x +2d e 



by the same argument as before. The proof for a > 2, ko = 1 is entirely 
analogous and hence omitted. 



Part II: a = 2, k Q > 2 or a > 2, k 

For a = 2, fco = 2 we have 



M M 



p=-M q=~M 
M M 

£ E E 

p=-M q=~M 



n—1 n—r 



E E CUm{i?2(x s )F 2 (e S +p)-H'2(2; S +r)-ff2(e S +r+g)} 



r=— n+l s=l— r 
n—1 n—r 



=— n+l s=l— r 
+7:r e (»7x e + g)7ex(r- - p)7x £ (g) 

+7xx(^)7e£(^ + 9 - P)lex{r - p)<y X£ {r + < 



M M 



da;+rf £ — 1/ 



(| r | + l)2<fe-l(| r + g _p| + 1 ) 



2d e -l 



<J EE E [(bi+i^-Hki+i) 

{p=-M q=-M \r\<3M 

+ (|p| + l)^ + ^- 1 (|g| + l)^ + ^- 1 (|r + g| + l)^ +d -'- 1 (|r - p\ + \) d *+ d ^ 
+ (|r| + l) 2d -- 1 (|r+g-p| + l) 2rfE - 1 (|r-p| + l) <fe+de - 1 (|r+g| + l) da:+de - 1 ] } 

„ ( M M 

+- E E E [(N+i^-^kl+i^^-^kl+i^-Hk+^-Pl + i) 

^p=-M q=-M 3M<r<n 

+(N+i)^ +d£ " 1 (kl+i)^ +d£ " 1 (k+g|+i) da;+d£ " 1 (|r-p|+i)^ +d£ - 1 

+ (|r| + l) 2 ^" 1 (|r + ( Z -p| + l) 2 ^- 1 (|r-p| + l)^ + ^- 1 (|r + ( 7 | + l)^ + ^- 1 ]} 

= 0(n _1 M M:t;+2dE M Me ) + 0(n- 1 M dx+ds M d * +ds M d * +ds ) + O^M 3 ^ 4 " 3 '**) 

+0(n- 1 M 2d - +2 ^n 2d - +M ^ 1 ) + 0(rT 1 M 2d " +2d 'n 2d ' +2d '- 1 ) + 0(M 2 n-V d * +4(fe - 3 ) 



2d e -l 
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I M—M 2dx+Ade ~ 2 ^ +0 ( —M 2dx+Me ~ 2 
n J V n 



+ o{M MM ^ 2 ) 
o(M 2d2+2ds ) because 2d 2 = 4d x - 1 and 2d e = M £ - 1 . 



For ko > 2 the argument is very much the same: 



M M 



p=—M q=-M 
M M 

s E E 



n— 1 n— r 



^ cum{H 2 (x s )H~ ko (e s+p )H 2 (x s+r )H~ ko (e s+r+q )} 



n 



r=— n+1 s=l— r 
n—1 n—r 



p=—Mq=—M " r=—n+ls=l—r 

+lxe(p)lxe(q)jxe{r + q)lex{r - p)^~ 2 (r + q-p) 

+lxx{r)lex{r -p)lxe{r + ghe^V + Q ~ p) 

= 0(n _1 M M:E+M£ M^o" 1)(Me ~ 1)+1 ) + 0( n - 1 M M;c+Me M ( ~ fc o- 2 )( 2 ^- 1 )+ 1 ) 

+0(n~ 1 M 2 ^M d " +d£ M ( ~ fc o- 1 )( 2 ^- 1 )+ 1 ) +( 9(- n -ijvf 2 ^+ 2 ^n 2d:c " 1+( ~ A: o- 1 )( 2 ^- 1 )+ 1 ) 

+ Q/ n -lj W -2dx+2d erl 2 ( J x +2d £ -l+(fe -2)(2d e -l)N _j_ Q/ M 2 n -l n 4d x +2d s -2+(h -l){2d e -l)j 

= o ( M M 2dm ~ 1 M* o(2de ~ 1)+1 \ + o ( M 2dx+2de ~ 1+ ^ o(2de ~ 1)+1 
\ n J \ n 

+ M 3t k~ 1 ~ t kMM 2de ~ 1)+ A +0 f — m 4 ^~ 1 M* o(2 '* s ~ 1 ) +1 
\ n J \ n 

= o(M 2d2+2de ) , because 2d e = k (2d e - 1) + 1 . 
Part III: a > 3, k > 3 

We note that, by the diagram formula (as in Qlfijl) 



cum 



H a (x s )H ko {e s+p )H a (x s+r )H kn {e s+r+q ) 



< C|cum[JT3(z s )fl3(e a+ p)fl3(£c s+r .)JT 3 (e s+7 . + g)]| . 

It suffices then to focus on a = ko = 3. There are seven different kinds of 
connected diagrams, which are represented in Figures 1 to 7. We have 



M ai . 



p=-M q=-M 
M M 

n2 

p=-M q=-M 



n—1 n—r 



c 



^ ^ cum{i?3(x s )F 3 (e s+ p)i73(x s+r )i?3(e s+r . + g)} 

=— n+1 s=l— r 
n—1 n—r 

Yl ^xs(phls(q)lsx(r - phxe(r + q) + 



r=— n+1 s=l— r 
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+ lxe{p)lxx{r)lxe{r + q)lee{r + q- phex(r - p)j X£ {q) 

+ lxx( r hse( r + Q- P)lex(r - p)7 xe (r + q) 

+ lle(Phle(lhxx(rhee{r + Q ~ p) 
„2 /V.\„,2 



+ r 7xx( r )%e( r + 1 ~ P)lxe{p)lxe{q) 
+ lle( r ~ P)lle{ r + q)lxe{p)lxe{q) 
+ lle( r ~ P)lle( r + q)lxx{r)"1ee{r + q ~ p)\ 



c 



M 



M 



< ^ E dpi + i) 2( ^ +de_1) E + i) 2( ^ +d£_1) 



p=~M 



q=-M 



(\r -p\ + l)^+^- x (|r + g| + l)*^" 1 

|r|<2Af 



+ ^ (k-p| + i)* ,+ *" 1 (k + ?l + i) << ' +<ie " 1 

2M<|r|<n 



(17) 



C 



M 



M 



+ - E (bi + E (iffi + !) 



d x +d E — 1 



p=-Af 



q=-M 



E (M + ij^^flr + 9 - pI + i) 2 * -1 ^ - pI + i)^+ de - 1 (k + ?l + 1) 



dz+de — 1 



|r|<3A/ 



+ ^ (| r | + l)^-l(| r+9 _p| + l)^-l(| r _ p | + l)*^-l(| r+g | + l)«fc-W.-l 
3M<|r|<n 



(18) 



C 
n 



M M 

E (\r\ + l) 2{2d *- 1] E E (lr + g-pl + 1) 2 ^ 

r|<3AJ p=-M q=-M 



M M ( 

x (| r _p| + l)cfc-Hfc-l(| r+ g| + i)4>-Hfc-l) + E E E (|r| + l) 2(2 ^- 1} 

p=-M q=-M \ZM<\r\<n 

X | r + g-p| + l) 2 (a*- 1 )(| r _p| + l)«*»+*- 1 (| r + g| + l)«*.+*- 1, )l (19) 



AY 



+ _ ^ (| p | + 1)2(^+^-1) (\q\ + l) 2 ^^- 1 ) 

71 p=—M q=-M 



E (M + i) 2 ^- 1 

,\r\<3M 



(\r+q- P \ + l) 2d *- 1 ) + (M+l^^Clr+g-pl+l) 2 * -1 

3M<\r\<n 



(20) 
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c 



M 



M 



+ - E (N + 1) 44 *" 1 E (M + 1) 



da;+d e — 1 



p=-M 



q=-M 



(|^| + 1)2(2^-1) 

,\r\<3M 



(| r + g _p| + l)2(2«fc-D) + £ (| r | + l)2(2^-l) ( | r + g _ p | + 1) 2(2 ( i £ -l) 

3Af<|r|<n 



(21) 



C 



- E dpi + 1)^^" 1 E + 1)^^- 1 



p=-M 



q=-M 



£ (|r + 9 | + l)2W.+«fc-i) 

|r|<3M 



(Ir-pl + l) 2 ^^- 1 ^ + ^ (Ir+gl + lJ^+^-^dr-pl + l) 2 ^- 1 ^- 1 ) 

3M<|r|<n 



(22) 



\ 2d*-l 



„ Af M 

+ ^ E (H + i) 24 '- 1 E E (k+p-91 + i) 2 *- 1 

|r|<3M P =-M q=-M 

x(\r-p\ + l) 2 ^ +d -- 1 )(|r + g| + 1)2(^+^-1) + ^ (|r| + l) 2 

3M<|r|<n 

M M 

X E E (\r+p-q\ + l) 2dE ~ 1 (\r-p\ + l) 2{dx+dE - 1) (\r + q\ + l) 2{dx+d ^ 1) {23) 

p=-M q=-M 

After lengthy but straightforward computations, it is not difficult to see that 

(jfTj) _ Q^ n ~i M 2d *+ 2d £- 1 M 2dx+2dE ~ 1 M dx+dE ) + 0(n~ 1 M Adx ~ 1 M 4dE ~ 1 n 2dx+2di: ~ 1 ) 

( M<Ld x +Ad e -l • 



n 



= 0(n" 1 M 4+dE M 4+,i£ M 2dl+2dE " 1 ) + 0(n" 1 M 4+,iE M 4+4 n 44+4 ' i£ " 3 ) 



n 



+ o(M 4 



idx-l n /rid E -l 2d x +2d E -2 



M 



n 



PJ = 0(n- l M Ux - 1 M M ^) + 0{n- 1 M 2 n (idx+%d ^) 



O I ] + o(M 4 ^" i M 



y n 

Oin^ 1 M 2dx +2de ~ 1 M 2da: +2d,E ~ 1 M Ma: ) + O (n~ 1 M 4da: ~ 1 M 4d,£ ~ 1 n 2dx +2de ~ 1 ) 

/ M&dx+ide-l ' 

O 

V n 



+ 0(M^~ 1 M 4ds ~ 1 7l 2d!,!+2ds ~ 2 ) 



(EU = 0{n- l M dx+d -M dx+d -M id -- 1 ) + 0(n- 1 M 2dx M 2dx n 4dx+M ^- 3 ) 



[ ) + o(M 



4d x - 1 M 4d E -l n 2d x +2d E -2 
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(|22|) = O^" 1 ^/ 4 ^^ 4 ^^ 244 " 24 ^ 1 ) + 0(n^ 1 M 24+2d£ n 4c!l+4d ^ 3 ) 

= O ( ^— ) + o(M Mx ~ 1 M 4de ~ 1 n 2dx+2de ~ 2 ) 

p3|) = 0{n- l M 2d *M 2d * +2ds ) + 0(n~ 1 M 2 n 6dx+6dE ~ 5 ) 

jurAd x +Ad e ~l 

= o(- ) + o(M 4d *" 1 M 4 ' 4 - 1 n M * +2 ' 4 - 2 ) 

n 

Ji/[4:d x +4d £ -l 

= Of- ) + (M idx ~ 1 M' ld£ ~ 1 n 2dx+2d£ ~ 2 ) . 

n 

In view of the previous results, our proof will be completed if we show that 

|_ M 4d x -1 M 4d £ -l n 2d x +2d £ -2 _ < M 2d a +2d e \ 

n ' 

where 

Ida + 2d e = 2ad x + 2k d £ - (a + k ) + 2 . 
We note first that 

^4^+4^-1 Midx+Ade-l j^2(2-a)d x +2(2-k )d £ -3+a+k 

n J\/j2d a +2d e l M 2ad x +2k d e -{a+k )+2\ n 



l M 2d "+ 2d e n (M2ad x +2k d £ -(a+k Q )+2\ 
From (jlljl and d e > it follows that 



11 I T l l 

d x > and d e > — , 

2 2a 2 2k 

whence, because a,l:o>2we have 

M 2(2-a)d x+ 2(2-k )d E -3+a+~k M 2(2- a )(|-i)+2(2-fc )(§-^)~3+a+fc 
< 

n n 

'M 3s 



1? 



— =°(1) > 



in view of Assumption C. To complete the proof, note that, again from 
Assumption C, for some a > a — 2, ko — 2 we have M a = 0(n), whence 

n/r4d x +4d £ -2„2d x +2d E -2 

M _ _ = o ( A f(2-a+a)(2^-l)+(2-fc +a)(2d £ -lK = ag 

M2ad x +2kod e -(a+ko)+2 

Thus the proof is completed. □ 
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X s 

•Es+r 
£ s+r+q 




Figure 4: 7L(p)7L(?)7^(r)7 £e (^ + Q~p) 



x s 

£s+p 



X 



S + r 



'-s+r+q 




v s+r 



-s+p 



- s+r+q 



Figure 5: ll £ {p)l 2 X£ {q)l 2 xx { r )l 2 ee( r + Q~P) 
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Figure 6: lxe(p)lxe(q)ll £ { r + lhex( r - P) 




Figure 7: -f xx {r)^ ££ (r + q- p)"/% e (r + q)j £X (r - p) 
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